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Nucleic add mutation assays 
FIELD OF THE INVENTION 

5 The present invention relates to the field of biochemistry and 

molecular biology, and is especially concerned with assays of nucleic 
acid for detecting and/or locating mutations therein, especially 
point nutations or insertion/deletion nutations involving the bases 
of just a few sequential nucleotides in, for exanple, the DNA of 
10 genes coding for proteins. Particularly in connection with this last 
aspect, the invention can have useful practical applications in 
biotechnology and medical diagnostics . 

BACKGROUND 

15 

As is caiman knowledge, nutations can arise in sections of the 
nucleotide strands of genomic DNA either by deletion or insertion of 
part or all of a nucleotide base sequence or by an alteration of one 
or more nucleotide bases . Transcription from such mutated DNA 
20 sections may Chen lead to defective protein products. Such defective 
protein products can be a cause of many genetic diseases or 
disorders. Such defective protein products that arise in 
fermentation processes in biotechnology may also be dysfunctional or 
harmful . 

25 

The ability to detect nutations in coding and non- coding DNA. 
is inportant for the diagnosis of inherited disease. Nucleotide 
changes in a normal (i.e. wild- type) gene sequence are called gene 
mutations and can be either harmful or harmless. For exairple, a 

30 harmful gene mutation can be a single base pair change in a gene 
encoding an essential protein. A single base pair change or small 
insertion or deletion can result in a frame shift, a stop codon, or a 
nan- conservative amino acid substitution, each of which can result in 
an inactive protein. A harmless gene mutation cam be a gene 

35 polymorphism which results in a protein product with no detectable 
change in normal function. Mutation in non-coding DNA can also lead 
to disease, as in, for example, nutations in non-coding splice sites 
(found in certain cases of cystic fibrosis disease for instance) or 
mutations in transcriptional regulatory elements (found in certain 
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defects of £-globin genes) . 

It is possible to form four sets of nucleotide mismatches when 
a mutant and normal DNA segment are annealed together. These sets 
5 include: G:A/C:T, C:C/G:G, A:A/T:T, and C:A/G:T. Each nucleotide 
pair represents eight possible single base pair mismatches which 
could be found in a ENA heteroduplex. However, DNA:RNA and RNA:RNA 
heteroduplexes can also be formed. Where a heteroduplex includes 
RNA, 9 single base pair mismatch sets are possible. DNA: DNA, ENA:RNA 
10 and RNA:RNA heteroduplexes can also be created by insertion or 
deletion of nucleotides in the mutant nucleic acid strand. 

One exarrple of a harmful mutation is provided by the well- 
known case of sidde cell anaemia where a point nutation involving an 

15 alteration of but a single nucleotide base, namely the substitution 
of a specific adenine by thymine in the genomic DNA, is responsible 
for the defect. In the case of cystic fibrosis the disease can arise 
from the presence of any one of a nunber of possible point nutations 
or small insertions or deletions that have been identified in 

20 different parts of the cystic fibrosis transmembrane regulator (CFTR) 
gene. Mutations in genomic DNA encoding oncogenes and tumour 
suppressors are also believed to be responsible for cell 
proliferation that causes many cancers. 

25 Various methods of testing and detecting nutations in nucleic 

acids are known, many of which use for eranple a preliminary stage of 
polyrrerase chain reaction (POO amplification. Mary of the existing 
nethods are limited to cases where the precise nature and location of 
the nutation or molecular change being saight is already known and/or 

30 is of a particular kind. However, in many instances of disease- 
causing mutations the precise nature and location of the mutations 
are not known. A number of known methods of detecting unknown 
mutations in nucleic acids, such as SSCP, heteroduplex analysis, 
RNAse protection and chemical cleavage of heteroduplexes, are 

35 discussed in a review article by Markus Grompe entitled "The rapid 
detection of unknown mutations in nucleic acids" published 1993 in 
Nature Genetics, 5, 111- 117. As yet there is no universal ideal 
method available and there is a need for more sensitive, rapid and 
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efficient methods of detecting mutations in DNA. Such methods should 
also be capable of locating in a target nucleic acid or gene the 
position of point mutations or small mutations involving only a few 
bases . 

5 

When a nutation occurs in normal double stranded DNA of a 
living organism, initially this will generally affect one strand only 
of the duplex molecule, causing a mismatch in the base' pairing. For 
example, with the occurrence of a point mutation a nucleotide 

10 cytosine base (C) in one strand may be changed so that the 
complementary guanine base (6) in the other strand becomes opposed to 
an adenine (A) or to a thymine base (T) , producing a base-pair 
mismatch in the duplex molecule. However, certain proteins and 
enzymes, referred to as "proof-reading" proteins or enzymes or "DNA 

15 mismatch repair proteins or enzymes", are usually present in most 
living organisms and these enzymes act to detect such base -pair 
mismatches and to initiate a repair process in the mutated region 
before the molecules replicate and pass on the defect to subsequent 
copies of the DNA. One example of a set of such mismatch "repair 

2Q enzymes", believed to be present in all living organisms, is provided 
by the Mut series of proteins and homologues thereof. A very well 
characterised system of these Mut repair enzymes, occurring for 
example in the bacterium E.coli, has been described by Paul Modrich 
and colleagues (for Review article, see for example Modrich, P. 

25 (1991), Annual Rev. Genet., 25, 229-253), a set of three proteins 
having been identified and purified which are termed MutS, MutH and 
MutL. These detect errors in DNA replication by interacting with 
double-stranded nucleic acid molecules containing mismatched base 
pairs that arise when errors and new nutations occur. The ENA-repair 

30 protein MutS in particular is a highly conserved protein which has 
the ability to detect and bind to the sites of mismatched bases 
(other than C:C) or deletions or insertions of up to four bases (see 
for example paper by Shin- San Su and Paul Modrich, "Escherichia coli 
iiutS- encoded protein binds to mismatched ENA base pairs" (1986) Proc. 

35 Natl. Acad. Sex. U.S.A., 83, 5057-5061) . MutS then recruits the MutH 
and MutL enzymes to create a nick at a CATG sequence near the 
nutation. Other enzymes then repair that region between the nutation 
and the nick (see also R.S. lahue et al, "ENA Mismatch Correction in a 
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Defined System" (1989) Science 245, 160-164) . 

The ability of various known mismatch repair enzymes to seek 
out and interact with mismatch regions in duplex DNA molecules has 
5 already led to some proposals for using such enzymes in assays to 
detect mutations responsible for such mismatches. Thus, in WO 
93/02216 (Upstate Biotechnology, Inc., Lake Placid, New York, 
U.S.A.) a method for detecting nutations such as a single base change 
or an addition or deletion of about one to four base pairs in duplex 

10 nucleic acid molecules or polynucleotides is disclosed which is based 
on the use of a DNA mismatch-binding protein such as MutS in 
conjunction with a method using antibody reagents or the like for 
specifically recognizing or detecting the presence of said protein 
bound to nucleic acid or polynucleotide molecules. WO 93/20233 

15 (University of Maryland at Baltimore) discloses a method for 
detecting single base pair mismatches at a preselected site in 
nucleic acids such as genomic DNA which is also based on the use of 
enzymes that repair mismatches in nucleic acids. In this latter 
method, however, most enphasis is given to the detection of base pair 

20 mismatches at particular sites using mismatch repair enzymes having 
endonuclease activity that specifically cleave nucleic acid strands 
near to mismatches . Neither of the methods disclosed in the two 
above-mentioned patent publications, however, appear to be well- 
suited for determining the location of unknown nutations in nucleic 

25 acids, and in WO 93/02216 in particular the method appears to be 
concerned specifically with determining whether a particular sample 
of DNA includes a mismatch nutation without consideration of locating 
any such mutation that may be detected. 

30 in a technique developed by Schmitz and Galas (Schmitz, A, and Galas, 
D.J. (1978) DNasel f out-pri nting: A sinple method for the detection of 
protein-DNA binding specificity." Nucleic Acids Research, 5, 3157- 
3170] for the study of sequence- specific binding of proteins to ENA, 
a DNA fragment is exposed to a sequence-specific DNA binding protein. 

35 After such tine as to allow for binding, the protein-DNA cortplex is 
treated with DNasel. The bound protein shields that region of DNA to 
which it is bound from digestion by DNasel, and after separation of 
the reaction products by gel electrophoresis, the protected region is 
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seen as a gap in the otherwise continuous background of digestion 
products . 

Footprint analysis has also been accomplished using DNA 
5 digesting enzymes which are processive, i.e. act on the DNA in a 3' 
to 5 r fashion. The prototype enzyme which has been used for this type 
of assay is Exonuclease III. The procedure for such exonuclease 
footprinting involves binding of sequence specific DNA binding 
protein to the DNA followed by exonuclease digestion. As with the 

!0 above-mentioned technique using DNase, the bound protein protects 
the DNA from digestion. However, due to the processive nature of the 
exonuclease, the reaction products do not consist of a background of 
randomly cleaved DNA fragments, rather they consist of two single 
stranded species which overlap and form double standed DNA only in 

15 the region of the ENA-protein interaction. Thus, after separation by 
gel electrophoresis, the region of DNA-protein interaction may be 
deduced by the lengths of the two single stranded reaction products. 

It is to be noted, however, that in none of the afore- 
20 mentioned prior art has there been any disclosure of a footprinting 
or similar technique involving endonucleases or exonucleases being 
used to detect mutations in DNA. 


SUMMARY OF THE INVENTION 

25 

Insofar as the present invention in its main aspect concerns 
assays to detect nutations or base mismatches occurring in nucleic 
acid molecules, it is based on a principle of forming heteroduplex 
nucleic acid ttolecules in which strands derived from an original 

30 nucleic acid test sanple are paired with non-mutant strands from a 
corresponding reference sanple. Said reference sample may, for 
example, be provided by the wild- type non-mutated nucleic acid or 
derivative thereof, for example a cloned DNA or amplified PCR 
derivative or an equivalent synthetic oligonucleotide. A mismatch- 

35 binding protein or repair enzyme, such as the MutS protein or 
hcnologues thereof for exanple, or other mismatch-binding protein (s) 
as herein disclosed, is then used to seek out and bind to any relevant 
mismatch site in the heteroduplex molecule resulting from point 
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nutations or fran insertions or deletions of a few sequential nucleo- 
tide bases in the strands from the test sanple. Upon then digesting 
the reaction mixture with an exonuclease all the nucleic acid 
molecules without any mismatches or small insertions or deletions are 
digested and completely removed. Heteroduplex molecules containing 
mismatches or small insertions or deletions will be detectable 
because the bound protein acts to block the exonuclease and protect 
the underlying strands from its action. Thus, residual single strand 
fragments will only be derived from the mismatch-containing hetero- 
duplex molecules. Analysis of these single strand fragments can then 
enable the locus of the mismatched bases to be determined. 

In a particularly preferred procedure the DMA protected by the 
bound mismatch-binding protein or repair enzyme is detected by 
digesting the nucleic acid reaction mixture with an exonuclease 
enzyme having a specific unidirectional exonuclease activity, under 
conditions in which the exonuclease enzyme progressively removes or 
deletes nucleotides from one end of each nucleic acid strand until 
reaching the site of a base-pair mismatch bound to the mismatch- 
binding protein or repair enzyme. The latter then blocks and 
protects against further exonuclease digestion along the strand. For 
such heteroduplex UNA. molecules confining a nutation and mismatch of 
bases there is therefore no conplete degradation by the exonuclease 
enzyme, and the presence of incompletely degraded relatively large 
DNA fragments can then be detected by various methods. By then 
determining the size of the fragments the location of the mismatch 
region and of the mutation may also be ascertained. 

Thus, according to one aspect the invention provides a method 
for detecting a mutation in a test sairple of target nucleic acid, 
said method comprising 

a) providing a control sample of reference nucleic acid 
capable of hybridizing to strands of the target nucleic 
acid from said test sample; 

b) denaturing if necessary said reference nucleic acid and 
said target nucleic acid, or replicates thereof, forming 
single-stranded reference nucleic acid and single- 
stranded target nucleic acid; 
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annealing said single- stranded reference nucleic acid 
and said single-stranded target nucleic acid, wherein 
said annealing is sufficient to form heteroduplex 
nucleic acid corposed of one strand derived from said 
target nucleic acid hybridized with a second strand 
derived from said reference nucleic acid; 
contacting said heteroduplex nucleic acid with a 
mismatch-binding protein capable of binding selectively 
to the site of said mutation; 

treating said annealed nucleic acid material with an 
exonuclease, the presence of said mismatch-binding 
protein being sufficient to protect said nucleic acid to 
which it is bound from digestion by said exoixuclease; 
and 

detecting the presence of said protected nucleic acid as 
a indication of the presence of said mutation in said 
target nucleic acid. 

In preferred embodiments the method further comprises 

20 determining the length of residual strand fragments of said protected 
nucleic acid, giving an indication of the location of said nutation. 
The determination of the length of such residual strand fragments, or 
size resolution and analysis thereof, is preferably carried out by 
gel electrophoresis, e.g. polyacrylamide gel electrophoresis (PftEE) , 

25 of the denatured nucleic acid exonuclease digestion products whereby 
the nucleic acid fragments are separated according to their relative 
size. For detecting and visualising the size separated fragments in 
the gel a suitable label (e.g. a fluorescent dye or radioactive 
agent) may be incorporated at an earlier stage into the reference 

30 nucleic acid molecules, or suitable labelled probes and Southern 
blotting for example may be used where sufficient sequence 
information is already available in respect of the expected fragments 
from particular nutations. Other alternative size separation methods 
or techniques, however, may also be used if desired, including for 

35 exanple capillary electrophoresis, size exclusion chromatography, 
high performance liquid chromatography (HPLC) and thin layer 
chromatography (TLC) . 
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The invention also provides a method for detecting single base 
mismatches or mutations in nucleic acid comprising the steps of 

(a) generating heteroduplex nucleic acid molecules in which 
single strands derived from a sample of the target nucleic 
acid under test are hybridized with corresponding non-mutant 
single strands derived from a reference nucleic acid sanple 
whereby mutant strands present in the original target nucleic 
acid test sanple become paired with non-mutant strands from 
the reference nucleic acid sanple to form mismatch-containing 
heteroduplex nucleic acid molecules; 

(b) contacting the nucleic acid material generated in step (a) 
with a mismatch-binding protein effective to seek out and 
selectively bind to mismatch sites in the heteroduplex 
molecules; 

(c) digesting the nucleic acid material from step (b) with an 
enzyme having exonuclease activity whereby said enzyme 
pr ogressively removes or deletes nucleotides from one end of 
each strand of the heteroduplex molecules until encountering 
said mismatch-binding protein bound to a mismatch site, said 
protein then acting to block further deletions and strand 
degradation by the exonuclease enzyme; and 

(d) analysing the nucleic acid digestion products from step (c) to 
determine the size or sizes of remaining single stranded 
nucleic acid fragments. 

The presently preferred mismatch-binding protein for use in 
carrying out the invention is a MutS encoded protein such as the 
commercially available K.coli MutS protein from the Mut repair 
protein series. This is effective, albeit with somewhat different 
affinities, to bind to mismatch regions produced by most of the 
different possible base mismatch pairings within a range of one to 
about four nucleotides. Other mismatch-binding proteins or repair 
enzymes, however, may also be used. These may be similar to or 
homologues of the above-mentioned MutS protein and will have 
equivalent characteristics insofar as the detection and binding to 
mismatch regions of interest in heteroduplex DNA molecules is 
concerned and ability to protect nucleic acid strands to which they 
are bound against exonuclease digestion. Such other mismatch-binding 


WO 95/29258 


PCT/GB95/00964 


9 

proteins include, for example, HexA from Streptococcus pneumoniae, 
MutS from Salmonella typhimurium, Mats from A. Vinlandii, MSH 1 from 
Saccharccnyces cerevisiae, HMSH2 from humans, NMSH2 from mouse, XMSH2 
from Xenopus, and pl60 from humans 

5 

Various exonucleases or DNA polymerases containing 
exonuclease activity may be used for the digestion stage, including 
exonuclease III, T4 DNA polymerase, Vent (Pol-ve) polymerase, 
bacteriophage lambda exonuclease and T7 DNA polymerase. The 
10 presently preferred enzyme, hcwever, is T7 DNA. polymerase which has a 
specific processive 3' to 5" exonuclease activity. 

In carrying out the invention, the original target nucleic 
acid of the test ' sample and also the reference nucleic acid will 
15 usually first be subjected to amplification either by cloning or, 
more conveniently, by using a conventional PGR amplification 
technique with appropriate primers and a high fidelity Taq 
polymerase . 

20 In preferred embodiments, the reaction mixture containing the 

heteroduplex nucleic acid molecules is treated with the mismatch 
binding protein such as the MutS protein under slightly alkaline 
conditions, most preferably in the pH range of 8.0 to 8.5, and in the 
presence of a divalent cation such as Mg ++ which should normally be 

25 at a concentration of at least 7nM, and preferably at a concentration 
of about 8rrM. The terqperature, however, at which the treatment with 
the mismatch binding protein is carried out is not usually critical, 
at least not with MutS, and may be within the range of about 0°C to 
about 30°C for example. 

30 

For marketing and practical use, all the basic essential 
materials and reagents required for carrying out the detection method 
of the present invention may be assembled together in a self- 
contained kit. Such kits thus provide another aspect of the 
35 invention. Such kits will include one or more mismatch-binding 
proteins or repair enzymes capable of seeking out and binding to 
mismatch sites in heteroduplex molecules and an enzyme having 
specific unidirectional exonuclease activity, each reagent being in 
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a separate container comprised within the kit, together with 
appropriate buffer and instructions for use. Assuming optional PGR 
anplif ication is to be used such kits may also include preselected 
primers, preferably detectably labelled, for PCR artplif ication of a 
5 particular fragment of a specific nucleic acid sequence or gene, Taq 
polymerase enzyme, a mix of deaxynucleo tides and buffers to provide 
the necessary reaction mixture for carrying out selective PCR 
amplification . 

10 The invention also provides a method for screening or testing 

human or animal subjects for the presence of a suspected genetic 
defect or mutated gene related disease, said method comprising 
obtaining nucleic acid from said subject and using that nucleic acid 
as the target nucleic acid in carrying out a method of detecting and 

15 localising a mutation as herein disclosed. 

The original target nucleic acid of the test sample will 
generally comprise double-stranded genomic or cloned DNA or may be 
cDNA derived possibly from mRNA. The reference nucleic acid 

2 Q providing a control sample will also frequently be provided by a 
fragment of double-stranded DNA, particularly non-mutated wild-type 
genomic DNA, or cloned DMA or cDNA, or even a synthetic 
oligonucleotide, all of which varieties are to be regarded as being 
covered by, and included within, the term "nucleic acid" as used 

25 herein. 

Hie invention, and an exanple of the manner in which it may be 
carried out in practice to detect mutations in genomic DNA, will now 
be described in more detail with reference to the accompanying 
3q drawings . 


BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is an annotated diagram illustrating the general 
principle involved in using a MutS DNA mismatch-binding protein or 
repair enzyne to detect and bind to a nucleic acid duplex molecule at 
the location of a base pair mismatch, and in then digesting the 
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nucleic acid with an enzyme having a specific 3' -5' exonuclease 
activity in order to enable the MutS binding subsequently to be 
detected and located by resolving and separating the residual single 
stranded fragments according to size using, for example, denaturing 
gel electrophoresis; 

FIGURE 2 illustrates the nucleotide base sequence (SBQ ID ND: 
1) of a fragment of the human cystic fibrosis transmembrane regulator 
(CFIR) gene containing, between flanking sequences, the CFIR exnn 11 
(GenBank accession number M55116) in which the locations of four 
known but different alternative point mutations, designated 1717-1 
G>A, S549N, G551D and R553X respectively, are indicated; 

FIGURE 3 sjtiows the base sequence of fluorescent labelled 
synthetic oligonucleotide primers used for PCR airplif icatian of a 492 
base pair (bp) sequence of the genomic DNA fragment depicted in 
FIGURE 2, (a) being the sequence (SBQ ID ND: 2) of one primer that is 
labelled with a blue fluorescent dye (designated FAM) and (b) being 
the sequence (SEQ ID NO: 3) of the second primer which is 
complementary to the sequence at the other end of the selected 492bp 
fragment and which carries a label consisting of a green fluorescent 
dye (designated JOE) ; 

FIGURE 4 illustrates the detection of the 1717-1 G>A nutation 
in the CFIR exen 11 sequence indicated in FIGURE 2 after subjecting a 
PCR amplification product thereof, obtained using the primers of 
FIGURE 3, to a method of mutation detection in accordance with the 
present invention, this method involving digestion of the PCR product 
that contains mutation mismatch heteroduplex molecules with a MutS 
mismatch-binding protein or repair enzyme, followed by degradation 
with a 3 1 to 5' exonuclease, size separation of the resulting DNA 
fragments by denaturing polyacrylamide gel electrophoresis, and then 
optically detecting the fluorescent dye label of relevant fragments, 
wherein (a) represents the spectrum obtained in relation to the FAM 
primer carrying the blue fluorescent label on one strand and (b) 
represents the spectrum obtained in relation to the JOE primer 
carrying the green fluorescent label on the complementary strand; and 
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FIGURES 5, 6 and 7 are each two part diagrams corresponding to 
FIGURE 4 but illustrating respectively the detection of other 
mutations S549N, G551D and R553X. 


5 DETAILED DESCRIPTION 


In FIGURE 1 a MutS mismatch-binding protein or repair enzyme 
is diagrammatically depicted bound to a mismatch region of a 
heteroduplex nucleic acid molecule which is suitably labelled at the 
10 5 1 end of at least one strand, and an enzyrre having a specific 3' to 
5» exonuclease activity is depicted on each strand while 
progressively degrading the latter from their 3* ends prior to a 
subsequent stage of analysis to resolve and separate the nucleic acid 
strand fragments according to size. 

15 

In the specific example to which FIGURES 2 to 7 of the 
drawings relate, the invention was tested by being used to detect 
contnonly occurring known mutations in exon 11 of the human cystic 
fibrosis transmembrane regulator (CFTR) gene which provides a test 
20 model for evaluating the method. 


A 492 base pair ENA amplification product of the CFTR gene was 
generated first by PCR using synthetic oligonucleotide primers 
(purchased from Applied Biosys terns) having the nucleotide sequences 

25 (SEQ ID HO: 2 and SEQ ID UD: 3) shown in FIGURE 3 and labelled with 
blue and green fluorescent dyes, these being designated FAM and JOE 
respectively. Using the above-mentioned fluorescent primers under 
standard PCR conditions, as described for example in the paper 
entitled "Primer-directed enzymatic amplif ication of DNA with a 

30 thermostable DNA polymerase" by Saiki et al (1988) Science 239, 487- 
91, using 58 °C annealing temperatures and a high fidelity Tag 
polymerase (Perkin-Elmer Ampitaq™ N801-0060) , thirty cycles of 
amplification were carried out from an initial sample of human 
genomic DNA made up of wild type reference DNA and target DNA in 

35 approximately equal amounts. The wild type reference DNA had a 
sequence corresponding to FIGURE 2 without nutations (standard DNA.) , 
whilst the target DNA being tested (obtained from human patients 
afflicted with cystic fibrosis) had a sequence corresponding to 
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FIGURE 2 tut containing one of the four different nutations indicated 
therein. This produced sanples of anplified ENA. labelled with EAM at 
one end and JOE at the other end of the molecule. 

The "standard" PCR conditions referred to above comprised 
denaturing for 5 minutes at 93 »C, then performing 30 cycles each 
ccnprising successive temperature levels of 95°C for 1 minute, 58°C 
for 1 minute and 72°C for 1 minute. Final extension was at 72 °C for 5 
minutes. Buffer was Tris (pH 8.4) lOnM, KC1 SOrrM, Mgd 2 l.SnM, dNTP 
200uM. Of the primers, 50 pmoles each were used in a 50^1 reaction 
mixture . Human genomic DMA. used was 50ng in a 50/il volume of 
reaction mixture. Tag polymerase was used at 1 unit per 50/xl 
reaction mixture . 

The PCR products were then purified by centrifugal 
microf iltration or dialysis using Centricon™ 100 tubes (Amicon) 
which removed excess primers and other reagents. In each case the 
PCR product (50^1) was diluted to 2ml in TE buffer, pH 8, and 
concentrated to a final volume of 25-30^1. The final wash used 
exonuclease buffer (50nM Ttis, pH 7.5, TuM MgCL 2 , SrtM DTT), and the 
ENA content of the purified PCR product was determined by tteasurenent 
of the optical density at 260nm (OD 260) . It may be noted that 
subsequent heteroduplex formation occurs better in the absence of Taq 
polymerase, so the Centricon step was included to remove this and 
other PCR reagents. Thus no new DNA synthesis occurs which might 
interfere with heteroduplex formation. 

The PCR product <20/tl to 30/d.) in lOOtrM NaCl, lOnM Tris (pH 
7.5) and UrM EDIA, was then heated to 95°C for five minutes, followed 
by cooling to 65 °C over a 60 minute period so as to promote the 
formation of heteroduplex molecules by hybridization of one strand of 
the non-mutant wild type DNA. with a strand of the test DNA that 
contains a mutation. This heteroduplex formation was also carried 
out in exonuclease buffer, i.e. 50mM Tris, pH 7.5? 7nM Mgd 2 , 5nM 
dithiothreitol. The reaction mixture containing the heteroduplex 
molecules thus formed was then either used inmediately or stored at 
-20°C until use. 
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2 pcnole (1-5 pmole ranges seem to work) of the DNA mixture 
containing the heteroduplex molecules produced (expected to be 50% 
heteroduplex, 50% hanoduplex) was then treated in the next stage with 
a preparation of the Mats protein to promote binding of the latter to 
5 the heteroduplex molecules at the site of mismatched bases. In more 
detail, 5 pmoles of MutS protein (purchased from United States 
Biochemicals Ltd. - catalogue number 71422 - and stored at -20°C 
until use) , 5/xl of dilution buffer (containing 50rrM Hepes, lOOrrM KC1, 
InM EDTA and InM UTT) was mixed and incubated on wet ice for 1 hour 

10 with the 2 pmoles of the DNA. in 15/d. of buffer (pH 7.5) containing 
SOrrM NaCi, lOrrM Tris-Cl, 7mM Mgd2 and InM EDTA made up to a total 
final volume 20/zl (subsequently, it has been found that a higher pH 
in the range of 8 to 8.5, preferably 8.5, can be beneficial, and that 
the Mg ++ concentration tray advantageously be increased to 8nM. Also, 

15 the MutS binding can be performed if desired at higher tenperatures, 
for exarrple at least up to about 30 °C) . After incubation on wet ice 
for 1 hour, 10 units of T7 polymerase (a processive exonuclease 
obtained from New England Biolabs - catalogue number 256S) were added 
and the tube was transferred to a 37°C waterbath. The actual amount 

20 of T7 polymerase used has varied in different experiments between 5 
and 10 units, but generally better signal-to-noise ratios have been 
seen using 10 units. After digestion for 3-5 minutes, this time 
feeing judged sufficient to ensure ccqolete degradation of all duplex 
DNA molecules free of mutations and therefore not bound to the MutS 

25 protein, the reaction was stepped fay the addition of lOpl gel loading 
buffer (deionised f ormamide containing 10oM EDIA) . The sarrples were 
heated to 90°C to denature the DNA and were then loaded, together 
with reference standards of known size (Applied Biosystems Rax 2500) , 
onto a polyacrylamide sequencing gel (6% concentration, containing 

30 7M Urea) for analysis. 

In this exanple the apparatus used for analysis was a model 
373 automated DNA sequencer of Applied Biosystems Inc. using Applied 
Biosystems 672 Genescanner™ software. This gave the results which 
35 are illustrated in FIGURES 4 to 7 and which are summarised in Tfcble 1 
at the end of the present description, fragment sizes being estimated 
using a third order least squares approximation. As will be seen frcm 
FIGURES 4 to 7, in this exanple distinct peaks were visible on both 
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strands which were absent in control samples, and a good signal to 
noise ratio was obtained although some small background peaks were 
present. 

The assay carried out substantially as described above has 
also been evaluated by testing other known nutations of the CFTR gene 
arid thus far it has been shown to be capable of detecting also the 
mutations G85E, R75X, R75Q and P67L in CFTR exon 3. 

It will of course be appreciated that in modifications of the 
method as described in the example above autoradiographic or other 
gel detection techniques could alternatively be used in carrying out 
the analysis provided they have sufficiently high resolution 
characteristics. Also, as already indicated, other size separation 
methods (particularly capillary electrophoresis, but also for exanple 
size exclusion chromatography, HPLC and thin layer chromatography) 
could readily be adapted to carry out the size separation process 
after the exonuclease treatment, if so desired. In addition, other 
labelling methods which could be used include the use of 
radiolabelled primers or PGR products, or instead of labelling the 
nucleic acid strands before digestion with the exonuclease enzyme 
silver staining or any other suitable DNA detection methods could be 
used to detect the GNA fragments after digestion and electrophoresis. 

the best excnuclease activity found so far has been seen with 
the exonuclease activity of T7 DNA. polymerase which is the presently 
preferred enzyme as previously mentioned, but other enzymes with 
processive, preferably unidirectional, exonuclease activity (either 
3' -5' or 5' -3') can also be suitable. In particular it may be found 
advantageous to use lambda exonuclease or Vent polymerase (Pol-ve) 
available from New England Biolabs. 

Although the Mats enzyme is presently the preferred mismatch- 
binding protein, other mismatch-binding proteins could also be used 
which may provide greater stability, including homologues of MutS, 
for exarrple the human MutS hcmologue hMSH2, or in some cases it may be 
found advantageous to use mixtures of Mut proteins or other mismatch- 
binding proteins. It may also be possible as another advantageous 
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alternative to use proteins which are not necessarily involved in 
repair of imitations but which nevertheless have a similar strong 
binding affinity specifically for base pair mismatch regions of 
nucleic acid molecules, for example the so-called Holiday junction 
binding RuvC protein from E. coli. If desired, genetic engineering 
techniques such as site directed mutagenesis may also be enployed to 
modify artificially MutS protein or homologues thereof so as, for 
example, to increase stability or impro v e sensitivity and binding 
characteristics. Also, in a further modification, stability of the 
heteroduplex molecules carrying the MutS or other mismatch-binding 
protein bound thereto may be improved by treatment with a chemical 
cross -linking agent, e.g. formaldehyde, before commencing the 
treatment with the exonuclease, thereby to cause the protein to be 
bound even more strongly for resisting and blocking the action of the 
exonuclease. 

15 

Various other modifications are of course also possible within 
the scope of the invention which includes all novel and inventive 
features and aspects herein disclosed, either explicitly or 
2Q implicitly and either singly or in cctrbination with one another. In 
particular, the scope of the invention is not to be construed as 
being limited by the illustrative examples or by the terms and 
expressions used herein merely in a descriptive or explanatory sense. 

25 
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Table 1 


Mutation 

Observed 
fragment 
size 
(blue) 

Distance from 
prister to 
mutation 
(blue) 

Observed 
fragment 
sixe 
(green) 

Distance 
from primer 
to mutation 
(green) 

wild type 

none 

HA 

none 

NA 

1717-1 G>A 

189 

184 

326 

307 

S549N 

253 

245 

263 

246 

GS51D 

259 

250 

256 

241 

R553X 

269 

256 

246 

235 
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SEQUENCE LISTING 


(1) GENERAL INFORMATION: 


(i) APPLICANT: 

(A) NAME: St. James's and Seacroft University Hospitals 

NHS Trust 

(B) STREET: Backet t Street 

(C) CITY; LEEDS 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) ; LS9 7 TP 


(A) NAME: Graham Roy Taylor 

(B) STREET: IS Tinshill Drive 

(C) CITY: LEEDS 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) : LS16 7DQ 

(ii) TITLE OF INVENTION: NUCLEIC ACID ASSAYS 


(iii) NUMBER OF SEQUENCES: 3 


(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentla Release #1.0, Version #1.30 (HPO) 

(Vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9408344.1 

(B) FILING DATE: 27 -APR- 19 94 


(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AXATACCCAT AAATATACAC ATATTTT A AT TTTTGGTATT TTATAATTAT TATTTAATGA 60 

TCATTCATGA CATTTTAAAA ATTACAGGAA AAATTTACAT CTAAAATTTC AGCAATGTTG 120 

TTTTTGACCA ACTAAATAAA TTGCATTTGA AATAATGGAG ATGCAATGTT CAAAATTTCA 180 

ACTGTGGTTA AAGCAATAGT GTGATATATG ATTACATTAG AAGGAAGATG TGCCTTTCAA 240 
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ATTCAGATTG AGCATACTAA AAGT6ACTCT CTAATTTTCT ATTTTTGGTA ATAGGACATC 300 

TCCAAGTTTG CAGAGAAAGA CAATATAGTT CTTGGAGAAG GTGGAATCAC ACTGAGTGGA 360 

GGTCAACGAC CAAGAATTTC TTTAGCAAGG TGAATAACTA ATTATTGGTC TAGCAAGCAT 420 

TTGCTGTAAA TGTCATTCAT GTAAAAAAAT TACAGACATT TCTCTATTGC TTTATATTCT 480 

GTTTCTGGAA TTGAAAAAAT CCTGGGGTTT TATGGCTAGT GGGTTAAGAA CACATTTAAG S40 

AACTATAAAT AATGGTATAG TATCCAGATT TGGTAGAGAT TATGGTTACT CAGAATCTGT 600 


10 


GCCCGTATCT TGG $ 13 


(2) INFORMATION FOR SEQ ID NO: 2s 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bate pairs 

(B) TYPE i nucleic acid 
15 (C) STRAND ED NESS t Eingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Oligonucleotide* 

20 SEQUENCE DESCRIPTION! SEQ ID NO: 2: 

AGCAATGTTG TTTTTGACCA ACTA 24 


25 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS i single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "Oligonucleotide" 


(xi) SEQUENCE DESCRIPTION r SEQ ID NO: 3: 
GCACAGATTC TGAGTACCCA TAAT 
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CLAIMS 

1. A method for detecting a mutation in a test sample of target 
nucleic acid, said method comprising 
5 a) providing a control sairple of reference nucleic acid 

capable of hybridizing to strands of the target nucleic 

acid from said test sample: 

b) denaturing said reference nucleic acid and said target 
nucleic acid, or replicates thereof, to an extent 

10 sufficient to form single- stranded reference nucleic 

acid and single- stranded target nucleic acid; 

c) annealing said single- stranded reference nucleic acid 
and said single- stranded target nucleic acid, wherein 
said' annealing is sufficient to form heteroduplex 

15 nucleic acid composed of one strand derived from said 

target nucleic acid hybridized with a second strand 
derived from said reference nucleic acid; 

d) contacting said heteroduplex nucleic acid with a 
mismatch-binding protein capable of binding selectively 

20 to the site of said mutation; 

e) treating said annealed nucleic acid material with an 
exonuclease, the presence of said mismatch-binding 
protein being sufficient to protect said nucleic acid to 
which it is bound from digestion by said exonuclease; 

25 and 

f ) detecting the presence of said protected nucleic acid as 
an indication of the presence of said nutation in said 
target nucleic acid. 

30 2. The method of Claim l, wherein subsequent to step (e) said 
method further ccnprises determining the length of residual strand 
fragments of said protected nucleic acid as an indication of the 
location of said mutation. 

35 3 . The nethod of Claim 2 wherein the determination of the length 
of said residual strand fragments of the protected nucleic acid is 
carried out by gel electrophoresis of denatured nucleic acid 
digestion products from step (e) whereby said residual strand 
fragments are separated according to their relative sizes. 
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4 . The method of Claim 2 wherein the method used for determining 
the length of said residual strand fragments of the protected nucleic 
acid is selected from capillary electrophoresis, size exclusion 
chromatography, high performance liquid chromatography (HPLC) and 
5 thin layer chromatography (TLC) . 


5 . A method for detecting single base pair mismatches or 
mutations in nucleic acid comprising the steps of 

(a) generating heteroduplex nucleic acid molecules in which 
single strands derived from a sanqple of the target nucleic 
acid under test are hybridized with corresponding non-mutant 
single strands derived from a reference nucleic acid sample 
whereby strands present in the original target nucleic acid 
test sampl6 become paired with non-mutant strands from the 
reference nucleic acid sanple to form mismatch- containing 
heteroduplex nucleic acid molecules; 

(b) contacting the nucleic acid material generated in step (a) 
with a mismatch-binding protein effective to seek out and 
selectively bind to mismatch sites in the heteroduplex 
molecules ; 

(c) digesting the nucleic acid material from step (b) with an 
enzyme having exonuclease activity whereby said enzyme 
progressively removes or deletes nucleotides from one end of 
each strand of the heteroduplex molecules until it encounters 
said mismatch-binding protein bcund to a mismatch site, said 
protein then acting to block further deletions and strand 
degradation by the exonuclease enzyme; and 

(d) analysing the nucleic acid digestion products from step (c) to 
determine the size or sizes of remaining single stranded 
nucleic acid fragments * 


6 . A method as claimed in Claim 5 wherein the size resolution and 
analysis operation of step (d) is carried out by gel electrophoresis 
of denatured acid digestion products from step (c) whereby the 
nucleic acid fragments are separated according to their relative 
size . 


7. 


A method as claimed in any of the preceding claims wherein a 
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detectable label is incorporated in said non-mutant strands derived 
from the reference nucleic acid. 

8 . A method as claimed in any of the preceding claims wherein the 
5 target nucleic acid and the reference nucleic acid are each 

subjected to PCR amplification in a preliminary stage before forming 
said heteroduplex nucleic acid molecules. 

9 . A method as claimed in any of the preceding claims wherein the 
10 exonuclease enzyme has specific unidirectional exonuclease activity. 

10 . A method as claimed in any of the preceding claims in which 
the mismatch-binding protein is MutS encoded protein, 

15 11. A method as claimed in any of Claims 1 to 9, in which the 
mismatch-binding protein is selected from HexA from Streptococcus 
jrcurmiae, MutS from E. coli, MutS from Salmonella typhiimirium, MutS 
from A. Vinlandii, MSH 1 from Saccharowyces cerevisiae, HMSH2 from 
humans, MMSH2 from noise, XMSH2 from Xencpas, or plSO from humans. 

20 

12 . A method as claimed in any of the preceding claims in which 
the reaction mixture containing the heteroduplex nucleic acid 
molecules is reacted with said mismatch-binding protein under 
slightly alkaline conditions and in the presence of a divalent 
25 cation. 

13 . A method as claimed in Claim 12 in which the slightly alkaline 
conditions are provided by a pH range of 8.0 to 8.5 and the divalent 
cation is Mg ++ at a concentration of at least 7nM (preferably about 
30 8mM) . 

14 . A method as claimed in any of the preceding claims in which 
the exonuclease activity is selected from the enzymes exonuclease 
III, T4 DNA polymerase, bacteriophage lambda exonuclease, Vent (Pol- 

35 ve) polymerase, and T7 DNA polymerase. 

15 . A kit comprising materials and reagents required for carrying 
out a method of detecting nucleic acid mutations in accordance with 
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one or more of the preceding claims, said kit including, in separate 
containers, an enzyme having exonuclease activity, a mismatch- 
binding protein (or proteins) capable of seeking out and selectively 
binding to base-pair mismatch sites in heteroduplex nucleic acid 
5 molecules and protecting said sites against digestion by said 
exonuclease enzyme, and appropriate buffer, together with 
instructions for use. 

16. The kit of Claim 15 which in addition contains reagents for 
10 carrying out a preliminary PGR anplification of a particular sequence 

of a target nucleic acid, said reagents including preselected 
primers, Taq polymerase enzyme, a mix of deaxynucleotides and buffers 
to provide the necessary reaction mixture for carrying out selective 
PCR amplification. 

15 

17 . A method for screening or testing human or animal subjects for 
the presence of a suspected genetic defect or nutated gene related 
disease, said method comprising obtaining nucleic acid from said 

2o subject and using that nucleic acid as the target nucleic acid in 
carrying out a method of detecting and localising a base mismatch or 
mutation in accordance with any of Claims 1 to 14. 


25 
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FIG. 1 


Mut S bound to 
heteroduplex 


Label 6 1 end 


•3* end 


Exonucteaso III 



^ end 
i'end {labelling possible) 


3* degraded by exonuclease until 
It encounters Mut S 


FIG . 3 

(a) FAM 5* AGC AAT GTT GTT TTT GAC CAA CTA 

(b) JOE 5' GCA CAG ATT CTG AGT ACC CAT AAT 
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FIG. 2 


ATATACCCAT AAATATACAC ATATTTTAAT TTTTGGTATT TTATAATTAT 50 

TATTTAATGA TCATTCATGA CATTTTAAAA ATTACAGGAA AAATTTACAT 100 
Blue Primer 

CTAAAATTTC AGCAATGTTG TTTTTCACCR ACTA AATAAA TTGCATTTGA 150 


AATAATGGAG ATGCAATGTT CAAAATTTCA ACTGTGGTTA AAGCAATAGT 200 
GTGATATATG ATTACATTAG AAGGAAGATG TGCCTTTCAA ATTCAGATTG 250 
AGCATACTAA AAGTGACTCT CTAATTTTCT ATTTTTGGTA ATAGGACATC 300 


TCCAAGTTTG CAGAGAAAGA CAATATAGTT CTTGGAGAAG GTGGAATCAC 350 

G551D(G>A) 
S549N(G>A) R553X(C>T) 

ACTGAGTGGA GGTCAACGAC CAAGAATTTC TTTAGCAAGG TGAATAACTA 400 


ATTATTGGTC TAGCAAGCAT TTGCTGTAAA TGTCATTCAT GTAAAAAAAT 450 


TACAGACATT TCTCTATTGC TTTATATTCT GTTTCTGGAA TTGAAAAAAT '500 

CCTGGGGTTT TATGGCTAGT GGGTTAAGAA CACATTTAAG AACTATAAAT 550 

Green primer (Reverse 
AATGGTATAG TATCCAGATT TGGTAGAGAT TATGGTTACT CAGAATCTGT 600 

Complement) 

GCCCGTATCT TGG 613 
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FIG.5. (a) (b) 
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